logo logo International Journal of Educational Methodology

IJEM is a leading, peer-reviewed, open access, research journal that provides an online forum for studies in education, by and for scholars and practitioners, worldwide.

Subscribe to

Receive Email Alerts

for special events, calls for papers, and professional development opportunities.

Subscribe

Publisher (HQ)

RHAPSODE LTD
Eurasian Society of Educational Research
College House, 2nd Floor 17 King Edwards Road, Ruislip, London, UK. HA4 7AE
RHAPSODE LTD
Headquarters
College House, 2nd Floor 17 King Edwards Road, Ruislip, London, UK. HA4 7AE

' Somers D' Search Results



...

Pearson product–moment correlation coefficient between item g and test score X, known as item–test or item–total correlation (Rit), and item–rest correlation (Rir) are two of the most used classical estimators for item discrimination power (IDP). Both Rit and Rir underestimate IDP caused by the mismatch of the scales of the item and the score. Underestimation of IDP may be drastic when the difficulty level of the item is extreme. Based on a simulation, in a binary dataset, a good alternative for Rit and Rir could be the Somers’ D: it reaches the ultimate values +1 and –1, it underestimates IDP remarkably less than Rit and Rir, and, being a robust statistic, it is more stable against the changes in the data structure. Somers’ D has, however, one major disadvantage in a polytomous case: it tends to underestimate the magnitude of the association of item and score more than Rit does when the item scale has four categories or more.

description Abstract
visibility View cloud_download PDF
10.12973/ijem.6.1.207
Pages: 207‒221
cloud_download 1096
visibility 1379
16
Article Metrics
Views
1096
Download
1379
Citations
Crossref
16

Scopus

...

Kelley’s Discrimination Index (DI) is a simple and robust, classical non-parametric short-cut to estimate the item discrimination power (IDP) in the practical educational settings. Unlike item–total correlation, DI can reach the ultimate values of +1 and ‒1, and it is stable against the outliers. Because of the computational easiness, DI is specifically suitable for the rough estimation where the sophisticated tools for item analysis such as IRT modelling are not available as is usual, for example, in the classroom testing. Unlike most of the other traditional indices for IDP, DI uses only the extreme cases of the ordered dataset in the estimation. One deficiency of DI is that it suits only for dichotomous datasets. This article generalizes DI to allow polytomous dataset and flexible cut-offs for selecting the extreme cases. A new algorithm based on the concept of the characteristic vector of the item is introduced to compute the generalized DI (GDI). A new visual method for item analysis, the cut-off curve, is introduced based on the procedure called exhaustive splitting.

description Abstract
visibility View cloud_download PDF
10.12973/ijem.6.2.237
Pages: 237 - 258
cloud_download 836
visibility 1025
6
Article Metrics
Views
836
Download
1025
Citations
Crossref
6

Scopus

...

A new index of item discrimination power (IDP), dimension-corrected Somers’ D (D2) is proposed. Somers’ D is one of the superior alternatives for item–total- (Rit) and item–rest correlation (Rir) in reflecting the real IDP with items with scales 0/1 and 0/1/2, that is, up to three categories. D also reaches the extreme value +1 and ‒1 correctly while Rit and Rir cannot reach the ultimate values in the real-life testing settings. However, when the item has four categories or more, Somers’ D underestimates IDP more than Pearson correlation. A simple correction to Somers’ D in the polytomous case seems to lead to be effective in item analysis settings.  In the simulation with real-life items, D2 showed very few cases of obvious underestimation and practically no cases of obvious overestimation. With certain restrictions discussed in the article, D2 seems to be a good alternative for these classic estimators not only with dichotomous items but also with the polytomous ones. In general, the magnitudes of the estimates by D2 are higher than those by Rit, Rir, and polychoric correlation and they seem to be close of those of bi- and polyserial correlation coefficients without out-of-range values.

description Abstract
visibility View cloud_download PDF
10.12973/ijem.6.2.297
Pages: 297-317
cloud_download 369
visibility 830
8
Article Metrics
Views
369
Download
830
Citations
Crossref
8

Scopus

...

Time management for educational leaders has remained highly relevant to scholars, policymakers and practitioners. We analyzed survey responses from 98 public high school principals to examine the congruency between average total hours they worked per week against the sum total of the average hours worked per week in each of five distinct categories of leadership tasks. The observed congruence was 0.32, while Cohen’s kappa coefficient was 0.10. Female principals tended to underreport, and male principals tended to overreport, total work time. Principals with doctorate degrees exhibited higher congruence than those without, and overreporting was inversely related to highest degree. Principals in charge of large teaching staffs were more likely than their counterparts to be congruent and less likely to overreport total work time. Self-report appears to be an inaccurate method to measure time use among high school principals. If time use is a key component of the quality of principal leadership, more detailed and robust techniques for collecting time use data should be utilized in future studies.

description Abstract
visibility View cloud_download PDF
10.12973/ijem.7.1.53
Pages: 53-65
cloud_download 404
visibility 638
0
Article Metrics
Views
404
Download
638
Citations
Crossref
0

Scopus

...

Although Goodman–Kruskal gamma (G) is used relatively rarely it has promising potential as a coefficient of association in educational settings.  Characteristics of G are studied in three sub-studies related to educational measurement settings. G appears to be unexpectedly appealing as an estimator of association between an item and a score because it strictly indicates the probability to get a correct answer in the test item given the score, and it accurately produces perfect latent association irrespective of distributions, degrees of freedom, number of tied pairs and tied values in the variables, or the difficulty levels in the items. However, it underestimates the association in an obvious manner when the number of categories in the item is more than four. Towards this, a dimension-corrected G (G2) is proposed and its characteristics are studied. Both G and G2 appear to be promising alternatives in measurement modelling settings, G with binary items and G2 with binary, polytomous and mixed datasets.

description Abstract
visibility View cloud_download PDF
10.12973/ijem.7.1.95
Pages: 95-118
cloud_download 874
visibility 839
9
Article Metrics
Views
874
Download
839
Citations
Crossref
9

Scopus

...